Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 1000000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 672.3 MiB |
| Average record size in memory | 705.0 B |
Variable types
| NUM | 12 |
|---|---|
| CAT | 11 |
| BOOL | 1 |
Reproduction
| Analysis started | 2020-10-16 11:16:57.187956 |
|---|---|
| Analysis finished | 2020-10-16 11:20:51.302344 |
| Duration | 3 minutes and 54.11 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
sid has a high cardinality: 2686 distinct values | High cardinality |
sdomain has a high cardinality: 2882 distinct values | High cardinality |
aid has a high cardinality: 3149 distinct values | High cardinality |
adomain has a high cardinality: 199 distinct values | High cardinality |
did has a high cardinality: 150266 distinct values | High cardinality |
dip has a high cardinality: 555865 distinct values | High cardinality |
dmodel has a high cardinality: 5150 distinct values | High cardinality |
hour is highly correlated with df_index | High correlation |
df_index is highly correlated with hour | High correlation |
E is highly correlated with B | High correlation |
B is highly correlated with E | High correlation |
df_index has unique values | Unique |
dtype has 55098 (5.5%) zeros | Zeros |
pos has 719953 (72.0%) zeros | Zeros |
| Distinct count | 1000000 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20231351.42731 |
|---|---|
| Minimum | 2 |
| Maximum | 40428890 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2023981.75 |
| Q1 | 10124574 |
| median | 20250724.5 |
| Q3 | 30332100.5 |
| 95-th percentile | 38401285.75 |
| Maximum | 40428890 |
| Range | 40428888 |
| Interquartile range (IQR) | 20207526.5 |
Descriptive statistics
| Standard deviation | 11671953.82 |
|---|---|
| Coefficient of variation (CV) | 0.5769240803 |
| Kurtosis | -1.200057968 |
| Mean | 20231351.43 |
| Median Absolute Deviation (MAD) | 10102820.5 |
| Skewness | -0.002827670346 |
| Sum | 2.023135143e+13 |
| Variance | 1.362345059e+14 |
| Value | Count | Frequency (%) | |
| 11779987 | 1 | < 0.1% | |
| 4946998 | 1 | < 0.1% | |
| 17519667 | 1 | < 0.1% | |
| 11226160 | 1 | < 0.1% | |
| 9411954 | 1 | < 0.1% | |
| 38517806 | 1 | < 0.1% | |
| 17531945 | 1 | < 0.1% | |
| 23610682 | 1 | < 0.1% | |
| 783399 | 1 | < 0.1% | |
| 9174054 | 1 | < 0.1% | |
| 36167796 | 1 | < 0.1% | |
| 32128030 | 1 | < 0.1% | |
| 21636125 | 1 | < 0.1% | |
| 23729179 | 1 | < 0.1% | |
| 32119834 | 1 | < 0.1% | |
| 2753561 | 1 | < 0.1% | |
| 20824433 | 1 | < 0.1% | |
| 10051040 | 1 | < 0.1% | |
| 36338710 | 1 | < 0.1% | |
| 26603551 | 1 | < 0.1% | |
| 2774035 | 1 | < 0.1% | |
| 34227217 | 1 | < 0.1% | |
| 12750374 | 1 | < 0.1% | |
| 27964431 | 1 | < 0.1% | |
| 38444042 | 1 | < 0.1% | |
| Other values (999975) | 999975 | > 99.9% |
| Value | Count | Frequency (%) | |
| 2 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 11 | 1 | < 0.1% | |
| 22 | 1 | < 0.1% | |
| 70 | 1 | < 0.1% | |
| 77 | 1 | < 0.1% | |
| 88 | 1 | < 0.1% | |
| 99 | 1 | < 0.1% | |
| 122 | 1 | < 0.1% | |
| 141 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 40428890 | 1 | < 0.1% | |
| 40428883 | 1 | < 0.1% | |
| 40428881 | 1 | < 0.1% | |
| 40428877 | 1 | < 0.1% | |
| 40428851 | 1 | < 0.1% | |
| 40428786 | 1 | < 0.1% | |
| 40428752 | 1 | < 0.1% | |
| 40428751 | 1 | < 0.1% | |
| 40428726 | 1 | < 0.1% | |
| 40428591 | 1 | < 0.1% |
like
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 830111 | 83.0% | |
| 1 | 169889 | 17.0% |
| Distinct count | 240 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19122558.732911 |
|---|---|
| Minimum | 19122100 |
| Maximum | 19123023 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 19122100 |
|---|---|
| 5-th percentile | 19122109 |
| Q1 | 19122304 |
| median | 19122602 |
| Q3 | 19122814 |
| 95-th percentile | 19123012 |
| Maximum | 19123023 |
| Range | 923 |
| Interquartile range (IQR) | 510 |
Descriptive statistics
| Standard deviation | 296.7519741 |
|---|---|
| Coefficient of variation (CV) | 1.551842398e-05 |
| Kurtosis | -1.336231511 |
| Mean | 19122558.73 |
| Median Absolute Deviation (MAD) | 287 |
| Skewness | -0.00760012114 |
| Sum | 1.912255873e+13 |
| Variance | 88061.73411 |
| Value | Count | Frequency (%) | |
| 19122209 | 10988 | 1.1% | |
| 19122210 | 10804 | 1.1% | |
| 19122813 | 10632 | 1.1% | |
| 19122212 | 10151 | 1.0% | |
| 19122814 | 9739 | 1.0% | |
| 19122211 | 9453 | 0.9% | |
| 19123004 | 8529 | 0.9% | |
| 19122809 | 8188 | 0.8% | |
| 19122208 | 7900 | 0.8% | |
| 19122213 | 7850 | 0.8% | |
| 19122808 | 7230 | 0.7% | |
| 19122205 | 7109 | 0.7% | |
| 19122815 | 7066 | 0.7% | |
| 19122206 | 7004 | 0.7% | |
| 19122816 | 6938 | 0.7% | |
| 19122817 | 6893 | 0.7% | |
| 19122304 | 6796 | 0.7% | |
| 19122105 | 6781 | 0.7% | |
| 19122812 | 6647 | 0.7% | |
| 19122104 | 6549 | 0.7% | |
| 19122417 | 6507 | 0.7% | |
| 19123014 | 6492 | 0.6% | |
| 19122811 | 6464 | 0.6% | |
| 19122810 | 6399 | 0.6% | |
| 19123005 | 6182 | 0.6% | |
| Other values (215) | 804709 | 80.5% |
| Value | Count | Frequency (%) | |
| 19122100 | 2935 | 0.3% | |
| 19122101 | 3444 | 0.3% | |
| 19122102 | 5059 | 0.5% | |
| 19122103 | 4852 | 0.5% | |
| 19122104 | 6549 | 0.7% | |
| 19122105 | 6781 | 0.7% | |
| 19122106 | 5888 | 0.6% | |
| 19122107 | 5094 | 0.5% | |
| 19122108 | 5149 | 0.5% | |
| 19122109 | 5658 | 0.6% |
| Value | Count | Frequency (%) | |
| 19123023 | 1954 | 0.2% | |
| 19123022 | 2528 | 0.3% | |
| 19123021 | 2769 | 0.3% | |
| 19123020 | 2743 | 0.3% | |
| 19123019 | 3310 | 0.3% | |
| 19123018 | 3901 | 0.4% | |
| 19123017 | 4401 | 0.4% | |
| 19123016 | 5232 | 0.5% | |
| 19123015 | 5859 | 0.6% | |
| 19123014 | 6492 | 0.6% |
| Distinct count | 2686 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 85f751fd | |
|---|---|
| 1fbe01fe | |
| e151e245 | 65115 |
| d9750ee7 | 23541 |
| 5b08c53b | 22730 |
| Other values (2681) |
| Value | Count | Frequency (%) | |
| 85f751fd | 360971 | 36.1% | |
| 1fbe01fe | 160356 | 16.0% | |
| e151e245 | 65115 | 6.5% | |
| d9750ee7 | 23541 | 2.4% | |
| 5b08c53b | 22730 | 2.3% | |
| 5b4d2eda | 19244 | 1.9% | |
| 856e6d3f | 19120 | 1.9% | |
| a7853007 | 11389 | 1.1% | |
| b7e9786d | 9128 | 0.9% | |
| 5ee41ff2 | 8658 | 0.9% | |
| 6399eda6 | 8657 | 0.9% | |
| 5bcf81a2 | 8286 | 0.8% | |
| 6256f5b4 | 7848 | 0.8% | |
| 57ef2c87 | 7622 | 0.8% | |
| 17caea14 | 6846 | 0.7% | |
| 83a0ad1a | 6686 | 0.7% | |
| 57fe1b20 | 6666 | 0.7% | |
| 0a742914 | 6665 | 0.7% | |
| e4d8dd7b | 6437 | 0.6% | |
| e8f79e60 | 6352 | 0.6% | |
| d6137915 | 5769 | 0.6% | |
| 6c5b482c | 4839 | 0.5% | |
| 12fb4121 | 4682 | 0.5% | |
| 93eaba74 | 4504 | 0.5% | |
| e5c60a05 | 4485 | 0.4% | |
| Other values (2661) | 203404 | 20.3% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| f | 1218699 | 15.2% | |
| 5 | 1134298 | 14.2% | |
| 1 | 994777 | 12.4% | |
| e | 743783 | 9.3% | |
| 7 | 625383 | 7.8% | |
| d | 583646 | 7.3% | |
| 8 | 550753 | 6.9% | |
| b | 380198 | 4.8% | |
| 0 | 354113 | 4.4% | |
| 4 | 251131 | 3.1% | |
| 2 | 241331 | 3.0% | |
| a | 206399 | 2.6% | |
| 6 | 195910 | 2.4% | |
| 9 | 179422 | 2.2% | |
| 3 | 173869 | 2.2% | |
| c | 166288 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 4700987 | 58.8% | |
| Lowercase Letter | 3299013 | 41.2% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 5 | 1134298 | 24.1% | |
| 1 | 994777 | 21.2% | |
| 7 | 625383 | 13.3% | |
| 8 | 550753 | 11.7% | |
| 0 | 354113 | 7.5% | |
| 4 | 251131 | 5.3% | |
| 2 | 241331 | 5.1% | |
| 6 | 195910 | 4.2% | |
| 9 | 179422 | 3.8% | |
| 3 | 173869 | 3.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| f | 1218699 | 36.9% | |
| e | 743783 | 22.5% | |
| d | 583646 | 17.7% | |
| b | 380198 | 11.5% | |
| a | 206399 | 6.3% | |
| c | 166288 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 4700987 | 58.8% | |
| Latin | 3299013 | 41.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 5 | 1134298 | 24.1% | |
| 1 | 994777 | 21.2% | |
| 7 | 625383 | 13.3% | |
| 8 | 550753 | 11.7% | |
| 0 | 354113 | 7.5% | |
| 4 | 251131 | 5.3% | |
| 2 | 241331 | 5.1% | |
| 6 | 195910 | 4.2% | |
| 9 | 179422 | 3.8% | |
| 3 | 173869 | 3.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| f | 1218699 | 36.9% | |
| e | 743783 | 22.5% | |
| d | 583646 | 17.7% | |
| b | 380198 | 11.5% | |
| a | 206399 | 6.3% | |
| c | 166288 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| f | 1218699 | 15.2% | |
| 5 | 1134298 | 14.2% | |
| 1 | 994777 | 12.4% | |
| e | 743783 | 9.3% | |
| 7 | 625383 | 7.8% | |
| d | 583646 | 7.3% | |
| 8 | 550753 | 6.9% | |
| b | 380198 | 4.8% | |
| 0 | 354113 | 4.4% | |
| 4 | 251131 | 3.1% | |
| 2 | 241331 | 3.0% | |
| a | 206399 | 2.6% | |
| 6 | 195910 | 2.4% | |
| 9 | 179422 | 2.2% | |
| 3 | 173869 | 2.2% | |
| c | 166288 | 2.1% |
| Distinct count | 2882 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| c4e18dd6 | |
|---|---|
| f3845767 | |
| 7e091613 | |
| 7687a86e | 32073 |
| 98572c79 | 24372 |
| Other values (2877) |
| Value | Count | Frequency (%) | |
| c4e18dd6 | 374079 | 37.4% | |
| f3845767 | 160356 | 16.0% | |
| 7e091613 | 82047 | 8.2% | |
| 7687a86e | 32073 | 3.2% | |
| 98572c79 | 24372 | 2.4% | |
| 16a36ef3 | 21382 | 2.1% | |
| 58a89a43 | 19120 | 1.9% | |
| b12b9f85 | 9259 | 0.9% | |
| 9d54950b | 9206 | 0.9% | |
| 17d996e6 | 8774 | 0.9% | |
| 968765cd | 8657 | 0.9% | |
| 28f93029 | 7848 | 0.8% | |
| bd6d812f | 7622 | 0.8% | |
| d262cf1e | 7219 | 0.7% | |
| 0dde25ec | 6846 | 0.7% | |
| 5b626596 | 6690 | 0.7% | |
| 5c9ae867 | 6686 | 0.7% | |
| 510bd839 | 6665 | 0.7% | |
| a17bde68 | 6437 | 0.6% | |
| c4342784 | 6352 | 0.6% | |
| 6b59f079 | 5891 | 0.6% | |
| bb1ef334 | 5769 | 0.6% | |
| 7256c623 | 5461 | 0.5% | |
| a434fa42 | 5049 | 0.5% | |
| 3f2f3819 | 4094 | 0.4% | |
| Other values (2857) | 162046 | 16.2% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| d | 902938 | 11.3% | |
| 6 | 891028 | 11.1% | |
| 8 | 812126 | 10.2% | |
| 1 | 694039 | 8.7% | |
| 4 | 677778 | 8.5% | |
| 7 | 642049 | 8.0% | |
| e | 633709 | 7.9% | |
| c | 537347 | 6.7% | |
| 3 | 439438 | 5.5% | |
| 5 | 364882 | 4.6% | |
| 9 | 337474 | 4.2% | |
| f | 311738 | 3.9% | |
| 2 | 202176 | 2.5% | |
| 0 | 194294 | 2.4% | |
| a | 189747 | 2.4% | |
| b | 169237 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 5255284 | 65.7% | |
| Lowercase Letter | 2744716 | 34.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| d | 902938 | 32.9% | |
| e | 633709 | 23.1% | |
| c | 537347 | 19.6% | |
| f | 311738 | 11.4% | |
| a | 189747 | 6.9% | |
| b | 169237 | 6.2% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 6 | 891028 | 17.0% | |
| 8 | 812126 | 15.5% | |
| 1 | 694039 | 13.2% | |
| 4 | 677778 | 12.9% | |
| 7 | 642049 | 12.2% | |
| 3 | 439438 | 8.4% | |
| 5 | 364882 | 6.9% | |
| 9 | 337474 | 6.4% | |
| 2 | 202176 | 3.8% | |
| 0 | 194294 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 5255284 | 65.7% | |
| Latin | 2744716 | 34.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| d | 902938 | 32.9% | |
| e | 633709 | 23.1% | |
| c | 537347 | 19.6% | |
| f | 311738 | 11.4% | |
| a | 189747 | 6.9% | |
| b | 169237 | 6.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 6 | 891028 | 17.0% | |
| 8 | 812126 | 15.5% | |
| 1 | 694039 | 13.2% | |
| 4 | 677778 | 12.9% | |
| 7 | 642049 | 12.2% | |
| 3 | 439438 | 8.4% | |
| 5 | 364882 | 6.9% | |
| 9 | 337474 | 6.4% | |
| 2 | 202176 | 3.8% | |
| 0 | 194294 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| d | 902938 | 11.3% | |
| 6 | 891028 | 11.1% | |
| 8 | 812126 | 10.2% | |
| 1 | 694039 | 8.7% | |
| 4 | 677778 | 8.5% | |
| 7 | 642049 | 8.0% | |
| e | 633709 | 7.9% | |
| c | 537347 | 6.7% | |
| 3 | 439438 | 5.5% | |
| 5 | 364882 | 4.6% | |
| 9 | 337474 | 4.2% | |
| f | 311738 | 3.9% | |
| 2 | 202176 | 2.5% | |
| 0 | 194294 | 2.4% | |
| a | 189747 | 2.4% | |
| b | 169237 | 2.1% |
scat
Categorical
| Distinct count | 20 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 50e219e0 | |
|---|---|
| f028772b | |
| 28905ebd | |
| 3e814130 | 75650 |
| f66779e6 | 6288 |
| Other values (15) | 13887 |
| Value | Count | Frequency (%) | |
| 50e219e0 | 409036 | 40.9% | |
| f028772b | 313005 | 31.3% | |
| 28905ebd | 182134 | 18.2% | |
| 3e814130 | 75650 | 7.6% | |
| f66779e6 | 6288 | 0.6% | |
| 75fa27f6 | 4037 | 0.4% | |
| 335d28a8 | 3405 | 0.3% | |
| 76b2941d | 2650 | 0.3% | |
| c0dd3be3 | 1045 | 0.1% | |
| 72722551 | 732 | 0.1% | |
| 70fb0e29 | 624 | 0.1% | |
| dedf689d | 584 | 0.1% | |
| 0569f928 | 394 | < 0.1% | |
| 8fd0aea4 | 206 | < 0.1% | |
| a818d37a | 87 | < 0.1% | |
| 42a36e14 | 61 | < 0.1% | |
| e787de0e | 24 | < 0.1% | |
| bcf865d9 | 21 | < 0.1% | |
| 5378d028 | 14 | < 0.1% | |
| 9ccfa2ea | 3 | < 0.1% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 1391792 | 17.4% | |
| 2 | 1230564 | 15.4% | |
| e | 1084739 | 13.6% | |
| 7 | 651547 | 8.1% | |
| 9 | 602128 | 7.5% | |
| 5 | 600505 | 7.5% | |
| 8 | 579030 | 7.2% | |
| 1 | 563866 | 7.0% | |
| b | 499479 | 6.2% | |
| f | 329199 | 4.1% | |
| d | 192383 | 2.4% | |
| 3 | 160362 | 2.0% | |
| 4 | 78628 | 1.0% | |
| 6 | 26611 | 0.3% | |
| a | 8095 | 0.1% | |
| c | 1072 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 5885033 | 73.6% | |
| Lowercase Letter | 2114967 | 26.4% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 1391792 | 23.6% | |
| 2 | 1230564 | 20.9% | |
| 7 | 651547 | 11.1% | |
| 9 | 602128 | 10.2% | |
| 5 | 600505 | 10.2% | |
| 8 | 579030 | 9.8% | |
| 1 | 563866 | 9.6% | |
| 3 | 160362 | 2.7% | |
| 4 | 78628 | 1.3% | |
| 6 | 26611 | 0.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 1084739 | 51.3% | |
| b | 499479 | 23.6% | |
| f | 329199 | 15.6% | |
| d | 192383 | 9.1% | |
| a | 8095 | 0.4% | |
| c | 1072 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 5885033 | 73.6% | |
| Latin | 2114967 | 26.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 1391792 | 23.6% | |
| 2 | 1230564 | 20.9% | |
| 7 | 651547 | 11.1% | |
| 9 | 602128 | 10.2% | |
| 5 | 600505 | 10.2% | |
| 8 | 579030 | 9.8% | |
| 1 | 563866 | 9.6% | |
| 3 | 160362 | 2.7% | |
| 4 | 78628 | 1.3% | |
| 6 | 26611 | 0.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 1084739 | 51.3% | |
| b | 499479 | 23.6% | |
| f | 329199 | 15.6% | |
| d | 192383 | 9.1% | |
| a | 8095 | 0.4% | |
| c | 1072 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 1391792 | 17.4% | |
| 2 | 1230564 | 15.4% | |
| e | 1084739 | 13.6% | |
| 7 | 651547 | 8.1% | |
| 9 | 602128 | 7.5% | |
| 5 | 600505 | 7.5% | |
| 8 | 579030 | 7.2% | |
| 1 | 563866 | 7.0% | |
| b | 499479 | 6.2% | |
| f | 329199 | 4.1% | |
| d | 192383 | 2.4% | |
| 3 | 160362 | 2.0% | |
| 4 | 78628 | 1.0% | |
| 6 | 26611 | 0.3% | |
| a | 8095 | 0.1% | |
| c | 1072 | < 0.1% |
| Distinct count | 3149 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| ecad2386 | |
|---|---|
| 92f5800b | 38510 |
| e2fcccd2 | 27831 |
| febd1138 | 18987 |
| 9c13b419 | 18782 |
| Other values (3144) |
| Value | Count | Frequency (%) | |
| ecad2386 | 639029 | 63.9% | |
| 92f5800b | 38510 | 3.9% | |
| e2fcccd2 | 27831 | 2.8% | |
| febd1138 | 18987 | 1.9% | |
| 9c13b419 | 18782 | 1.9% | |
| 7358e05e | 15125 | 1.5% | |
| a5184c22 | 11988 | 1.2% | |
| d36838b1 | 11299 | 1.1% | |
| 685d1c4c | 10085 | 1.0% | |
| 54c5d545 | 9924 | 1.0% | |
| 03528b27 | 7950 | 0.8% | |
| f0d41ff1 | 7219 | 0.7% | |
| e2a1ca37 | 6917 | 0.7% | |
| e9739828 | 6900 | 0.7% | |
| 51cedd4e | 5966 | 0.6% | |
| 66f5e02e | 5704 | 0.6% | |
| 03a08c3f | 5303 | 0.5% | |
| 98fed791 | 5286 | 0.5% | |
| 73206397 | 4961 | 0.5% | |
| f53417e1 | 4879 | 0.5% | |
| e96773f0 | 4403 | 0.4% | |
| ce183bbd | 3706 | 0.4% | |
| be7c618d | 2946 | 0.3% | |
| f888bf4c | 2588 | 0.3% | |
| 1dc72b4d | 2539 | 0.3% | |
| Other values (3124) | 121173 | 12.1% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| c | 871409 | 10.9% | |
| 8 | 861392 | 10.8% | |
| 2 | 854331 | 10.7% | |
| 3 | 832322 | 10.4% | |
| e | 832308 | 10.4% | |
| d | 818994 | 10.2% | |
| 6 | 743151 | 9.3% | |
| a | 737151 | 9.2% | |
| f | 214362 | 2.7% | |
| 5 | 209596 | 2.6% | |
| 1 | 202423 | 2.5% | |
| 0 | 190238 | 2.4% | |
| 9 | 173226 | 2.2% | |
| b | 170083 | 2.1% | |
| 4 | 153841 | 1.9% | |
| 7 | 135173 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 4355693 | 54.4% | |
| Lowercase Letter | 3644307 | 45.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| c | 871409 | 23.9% | |
| e | 832308 | 22.8% | |
| d | 818994 | 22.5% | |
| a | 737151 | 20.2% | |
| f | 214362 | 5.9% | |
| b | 170083 | 4.7% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 8 | 861392 | 19.8% | |
| 2 | 854331 | 19.6% | |
| 3 | 832322 | 19.1% | |
| 6 | 743151 | 17.1% | |
| 5 | 209596 | 4.8% | |
| 1 | 202423 | 4.6% | |
| 0 | 190238 | 4.4% | |
| 9 | 173226 | 4.0% | |
| 4 | 153841 | 3.5% | |
| 7 | 135173 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 4355693 | 54.4% | |
| Latin | 3644307 | 45.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| c | 871409 | 23.9% | |
| e | 832308 | 22.8% | |
| d | 818994 | 22.5% | |
| a | 737151 | 20.2% | |
| f | 214362 | 5.9% | |
| b | 170083 | 4.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 8 | 861392 | 19.8% | |
| 2 | 854331 | 19.6% | |
| 3 | 832322 | 19.1% | |
| 6 | 743151 | 17.1% | |
| 5 | 209596 | 4.8% | |
| 1 | 202423 | 4.6% | |
| 0 | 190238 | 4.4% | |
| 9 | 173226 | 4.0% | |
| 4 | 153841 | 3.5% | |
| 7 | 135173 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| c | 871409 | 10.9% | |
| 8 | 861392 | 10.8% | |
| 2 | 854331 | 10.7% | |
| 3 | 832322 | 10.4% | |
| e | 832308 | 10.4% | |
| d | 818994 | 10.2% | |
| 6 | 743151 | 9.3% | |
| a | 737151 | 9.2% | |
| f | 214362 | 2.7% | |
| 5 | 209596 | 2.6% | |
| 1 | 202423 | 2.5% | |
| 0 | 190238 | 2.4% | |
| 9 | 173226 | 2.2% | |
| b | 170083 | 2.1% | |
| 4 | 153841 | 1.9% | |
| 7 | 135173 | 1.7% |
| Distinct count | 199 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 7801e8d9 | |
|---|---|
| 2347f47a | 129463 |
| ae637522 | 46658 |
| 5c5a694b | 27835 |
| 82e27996 | 18988 |
| Other values (194) | 103271 |
| Value | Count | Frequency (%) | |
| 7801e8d9 | 673785 | 67.4% | |
| 2347f47a | 129463 | 12.9% | |
| ae637522 | 46658 | 4.7% | |
| 5c5a694b | 27835 | 2.8% | |
| 82e27996 | 18988 | 1.9% | |
| d9b5648e | 17663 | 1.8% | |
| 0e8616ad | 16272 | 1.6% | |
| b9528b13 | 15892 | 1.6% | |
| b8d325c3 | 12991 | 1.3% | |
| aefc06bd | 7463 | 0.7% | |
| df32afa9 | 7070 | 0.7% | |
| 33da2e74 | 6422 | 0.6% | |
| 6f7ca2ba | 5705 | 0.6% | |
| 5b9c592b | 2588 | 0.3% | |
| 885c7f3f | 1731 | 0.2% | |
| 5c620f04 | 1508 | 0.2% | |
| 45a51db4 | 1425 | 0.1% | |
| b5f3b24a | 1173 | 0.1% | |
| 813f3323 | 598 | 0.1% | |
| 0654b444 | 597 | 0.1% | |
| ad63ec9b | 433 | < 0.1% | |
| c6824def | 387 | < 0.1% | |
| a8b0bf20 | 343 | < 0.1% | |
| 15ec7f39 | 319 | < 0.1% | |
| 99b4c806 | 269 | < 0.1% | |
| Other values (174) | 2422 | 0.2% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 8 | 1435706 | 17.9% | |
| 7 | 1013967 | 12.7% | |
| e | 789966 | 9.9% | |
| 9 | 787536 | 9.8% | |
| d | 745088 | 9.3% | |
| 1 | 709224 | 8.9% | |
| 0 | 702901 | 8.8% | |
| 4 | 321174 | 4.0% | |
| 2 | 316690 | 4.0% | |
| a | 264336 | 3.3% | |
| 3 | 245459 | 3.1% | |
| f | 166225 | 2.1% | |
| 5 | 163178 | 2.0% | |
| 6 | 161064 | 2.0% | |
| b | 115338 | 1.4% | |
| c | 62148 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 5856899 | 73.2% | |
| Lowercase Letter | 2143101 | 26.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 789966 | 36.9% | |
| d | 745088 | 34.8% | |
| a | 264336 | 12.3% | |
| f | 166225 | 7.8% | |
| b | 115338 | 5.4% | |
| c | 62148 | 2.9% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 8 | 1435706 | 24.5% | |
| 7 | 1013967 | 17.3% | |
| 9 | 787536 | 13.4% | |
| 1 | 709224 | 12.1% | |
| 0 | 702901 | 12.0% | |
| 4 | 321174 | 5.5% | |
| 2 | 316690 | 5.4% | |
| 3 | 245459 | 4.2% | |
| 5 | 163178 | 2.8% | |
| 6 | 161064 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 5856899 | 73.2% | |
| Latin | 2143101 | 26.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 789966 | 36.9% | |
| d | 745088 | 34.8% | |
| a | 264336 | 12.3% | |
| f | 166225 | 7.8% | |
| b | 115338 | 5.4% | |
| c | 62148 | 2.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 8 | 1435706 | 24.5% | |
| 7 | 1013967 | 17.3% | |
| 9 | 787536 | 13.4% | |
| 1 | 709224 | 12.1% | |
| 0 | 702901 | 12.0% | |
| 4 | 321174 | 5.5% | |
| 2 | 316690 | 5.4% | |
| 3 | 245459 | 4.2% | |
| 5 | 163178 | 2.8% | |
| 6 | 161064 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 8 | 1435706 | 17.9% | |
| 7 | 1013967 | 12.7% | |
| e | 789966 | 9.9% | |
| 9 | 787536 | 9.8% | |
| d | 745088 | 9.3% | |
| 1 | 709224 | 8.9% | |
| 0 | 702901 | 8.8% | |
| 4 | 321174 | 4.0% | |
| 2 | 316690 | 4.0% | |
| a | 264336 | 3.3% | |
| 3 | 245459 | 3.1% | |
| f | 166225 | 2.1% | |
| 5 | 163178 | 2.0% | |
| 6 | 161064 | 2.0% | |
| b | 115338 | 1.4% | |
| c | 62148 | 0.8% |
acat
Categorical
| Distinct count | 26 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 07d7df22 | |
|---|---|
| 0f2161f8 | |
| cef3e649 | 42540 |
| 8ded1f7a | 36074 |
| f95efa07 | 28188 |
| Other values (21) | 9098 |
| Value | Count | Frequency (%) | |
| 07d7df22 | 647371 | 64.7% | |
| 0f2161f8 | 236729 | 23.7% | |
| cef3e649 | 42540 | 4.3% | |
| 8ded1f7a | 36074 | 3.6% | |
| f95efa07 | 28188 | 2.8% | |
| d1327cf5 | 3083 | 0.3% | |
| dc97ec06 | 1429 | 0.1% | |
| 09481d60 | 1390 | 0.1% | |
| 75d80bbe | 1019 | 0.1% | |
| fc6fa53d | 586 | 0.1% | |
| 4ce2e9fc | 496 | < 0.1% | |
| 879c24eb | 325 | < 0.1% | |
| a3c42688 | 278 | < 0.1% | |
| 4681bb9d | 172 | < 0.1% | |
| 0f9a328c | 111 | < 0.1% | |
| 2281a340 | 54 | < 0.1% | |
| a86a3e89 | 53 | < 0.1% | |
| 8df2e842 | 45 | < 0.1% | |
| 79f0b860 | 16 | < 0.1% | |
| a7fd01ec | 10 | < 0.1% | |
| 2fc4f2aa | 9 | < 0.1% | |
| 7113d72a | 8 | < 0.1% | |
| 18b1e0be | 6 | < 0.1% | |
| 0bfbc358 | 5 | < 0.1% | |
| 5326cf99 | 2 | < 0.1% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 1535992 | 19.2% | |
| d | 1374634 | 17.2% | |
| 7 | 1364902 | 17.1% | |
| f | 1260777 | 15.8% | |
| 0 | 917735 | 11.5% | |
| 1 | 514269 | 6.4% | |
| 6 | 283195 | 3.5% | |
| 8 | 276654 | 3.5% | |
| e | 153227 | 1.9% | |
| 9 | 74724 | 0.9% | |
| a | 65433 | 0.8% | |
| c | 50799 | 0.6% | |
| 3 | 46720 | 0.6% | |
| 4 | 45309 | 0.6% | |
| 5 | 32884 | 0.4% | |
| b | 2746 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 5092384 | 63.7% | |
| Lowercase Letter | 2907616 | 36.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 1535992 | 30.2% | |
| 7 | 1364902 | 26.8% | |
| 0 | 917735 | 18.0% | |
| 1 | 514269 | 10.1% | |
| 6 | 283195 | 5.6% | |
| 8 | 276654 | 5.4% | |
| 9 | 74724 | 1.5% | |
| 3 | 46720 | 0.9% | |
| 4 | 45309 | 0.9% | |
| 5 | 32884 | 0.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| d | 1374634 | 47.3% | |
| f | 1260777 | 43.4% | |
| e | 153227 | 5.3% | |
| a | 65433 | 2.3% | |
| c | 50799 | 1.7% | |
| b | 2746 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 5092384 | 63.7% | |
| Latin | 2907616 | 36.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 1535992 | 30.2% | |
| 7 | 1364902 | 26.8% | |
| 0 | 917735 | 18.0% | |
| 1 | 514269 | 10.1% | |
| 6 | 283195 | 5.6% | |
| 8 | 276654 | 5.4% | |
| 9 | 74724 | 1.5% | |
| 3 | 46720 | 0.9% | |
| 4 | 45309 | 0.9% | |
| 5 | 32884 | 0.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| d | 1374634 | 47.3% | |
| f | 1260777 | 43.4% | |
| e | 153227 | 5.3% | |
| a | 65433 | 2.3% | |
| c | 50799 | 1.7% | |
| b | 2746 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 1535992 | 19.2% | |
| d | 1374634 | 17.2% | |
| 7 | 1364902 | 17.1% | |
| f | 1260777 | 15.8% | |
| 0 | 917735 | 11.5% | |
| 1 | 514269 | 6.4% | |
| 6 | 283195 | 3.5% | |
| 8 | 276654 | 3.5% | |
| e | 153227 | 1.9% | |
| 9 | 74724 | 0.9% | |
| a | 65433 | 0.8% | |
| c | 50799 | 0.6% | |
| 3 | 46720 | 0.6% | |
| 4 | 45309 | 0.6% | |
| 5 | 32884 | 0.4% | |
| b | 2746 | < 0.1% |
| Distinct count | 150266 |
|---|---|
| Unique (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| a99f214a | |
|---|---|
| 0f7c61dc | 508 |
| c357dbff | 467 |
| 936e92fb | 353 |
| afeffc18 | 234 |
| Other values (150261) |
| Value | Count | Frequency (%) | |
| a99f214a | 825245 | 82.5% | |
| 0f7c61dc | 508 | 0.1% | |
| c357dbff | 467 | < 0.1% | |
| 936e92fb | 353 | < 0.1% | |
| afeffc18 | 234 | < 0.1% | |
| 28dc8687 | 121 | < 0.1% | |
| 987552d1 | 109 | < 0.1% | |
| cef4c8cc | 106 | < 0.1% | |
| d857ffbb | 99 | < 0.1% | |
| 3cdb4052 | 94 | < 0.1% | |
| b09da1c4 | 90 | < 0.1% | |
| 03559b29 | 66 | < 0.1% | |
| 02da5312 | 57 | < 0.1% | |
| 096a6f32 | 41 | < 0.1% | |
| bbcf14e4 | 39 | < 0.1% | |
| d2e4c0ab | 38 | < 0.1% | |
| f1d9c744 | 37 | < 0.1% | |
| eec6d022 | 34 | < 0.1% | |
| c35f5168 | 33 | < 0.1% | |
| e8343327 | 30 | < 0.1% | |
| 9af87478 | 28 | < 0.1% | |
| f58a1c3b | 26 | < 0.1% | |
| 0a04637d | 26 | < 0.1% | |
| 4e9e9550 | 25 | < 0.1% | |
| abab24a7 | 25 | < 0.1% | |
| Other values (150241) | 172069 | 17.2% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 9 | 1737292 | 21.7% | |
| a | 1736892 | 21.7% | |
| f | 914313 | 11.4% | |
| 4 | 912446 | 11.4% | |
| 2 | 912352 | 11.4% | |
| 1 | 912249 | 11.4% | |
| c | 88427 | 1.1% | |
| 3 | 87808 | 1.1% | |
| 7 | 87758 | 1.1% | |
| 5 | 87700 | 1.1% | |
| e | 87490 | 1.1% | |
| d | 87477 | 1.1% | |
| b | 87464 | 1.1% | |
| 6 | 86993 | 1.1% | |
| 0 | 86932 | 1.1% | |
| 8 | 86407 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 4997937 | 62.5% | |
| Lowercase Letter | 3002063 | 37.5% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 9 | 1737292 | 34.8% | |
| 4 | 912446 | 18.3% | |
| 2 | 912352 | 18.3% | |
| 1 | 912249 | 18.3% | |
| 3 | 87808 | 1.8% | |
| 7 | 87758 | 1.8% | |
| 5 | 87700 | 1.8% | |
| 6 | 86993 | 1.7% | |
| 0 | 86932 | 1.7% | |
| 8 | 86407 | 1.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 1736892 | 57.9% | |
| f | 914313 | 30.5% | |
| c | 88427 | 2.9% | |
| e | 87490 | 2.9% | |
| d | 87477 | 2.9% | |
| b | 87464 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 4997937 | 62.5% | |
| Latin | 3002063 | 37.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 9 | 1737292 | 34.8% | |
| 4 | 912446 | 18.3% | |
| 2 | 912352 | 18.3% | |
| 1 | 912249 | 18.3% | |
| 3 | 87808 | 1.8% | |
| 7 | 87758 | 1.8% | |
| 5 | 87700 | 1.8% | |
| 6 | 86993 | 1.7% | |
| 0 | 86932 | 1.7% | |
| 8 | 86407 | 1.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 1736892 | 57.9% | |
| f | 914313 | 30.5% | |
| c | 88427 | 2.9% | |
| e | 87490 | 2.9% | |
| d | 87477 | 2.9% | |
| b | 87464 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 9 | 1737292 | 21.7% | |
| a | 1736892 | 21.7% | |
| f | 914313 | 11.4% | |
| 4 | 912446 | 11.4% | |
| 2 | 912352 | 11.4% | |
| 1 | 912249 | 11.4% | |
| c | 88427 | 1.1% | |
| 3 | 87808 | 1.1% | |
| 7 | 87758 | 1.1% | |
| 5 | 87700 | 1.1% | |
| e | 87490 | 1.1% | |
| d | 87477 | 1.1% | |
| b | 87464 | 1.1% | |
| 6 | 86993 | 1.1% | |
| 0 | 86932 | 1.1% | |
| 8 | 86407 | 1.1% |
| Distinct count | 555865 |
|---|---|
| Unique (%) | 55.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 6b9769f2 | 5085 |
|---|---|
| 431b3174 | 3463 |
| 2f323f36 | 2198 |
| 930ec31d | 2166 |
| af9205f9 | 2163 |
| Other values (555860) |
| Value | Count | Frequency (%) | |
| 6b9769f2 | 5085 | 0.5% | |
| 431b3174 | 3463 | 0.3% | |
| 2f323f36 | 2198 | 0.2% | |
| 930ec31d | 2166 | 0.2% | |
| af9205f9 | 2163 | 0.2% | |
| d90a7774 | 2105 | 0.2% | |
| 6394f6f6 | 2088 | 0.2% | |
| af62faf4 | 2083 | 0.2% | |
| 009a7861 | 2030 | 0.2% | |
| 285aa37d | 2012 | 0.2% | |
| c6563308 | 1793 | 0.2% | |
| 0489ce3f | 1756 | 0.2% | |
| ddd2926e | 1754 | 0.2% | |
| a8536f3a | 1731 | 0.2% | |
| ceffea69 | 1727 | 0.2% | |
| 488a9a3e | 1719 | 0.2% | |
| 1cf29716 | 1716 | 0.2% | |
| 8a014cbb | 1688 | 0.2% | |
| 57cd4006 | 1684 | 0.2% | |
| 75bb1b58 | 1671 | 0.2% | |
| 9b1fe278 | 1591 | 0.2% | |
| 07875ea4 | 933 | 0.1% | |
| 7ed30f6c | 921 | 0.1% | |
| b0070d9a | 895 | 0.1% | |
| ac77b71a | 868 | 0.1% | |
| Other values (555840) | 952160 | 95.2% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| f | 518304 | 6.5% | |
| 3 | 512224 | 6.4% | |
| 9 | 512135 | 6.4% | |
| 6 | 511528 | 6.4% | |
| a | 507854 | 6.3% | |
| 7 | 502564 | 6.3% | |
| 4 | 499049 | 6.2% | |
| 0 | 498232 | 6.2% | |
| 1 | 497414 | 6.2% | |
| 2 | 496392 | 6.2% | |
| b | 495025 | 6.2% | |
| 8 | 491606 | 6.1% | |
| d | 491475 | 6.1% | |
| e | 491376 | 6.1% | |
| c | 488901 | 6.1% | |
| 5 | 485921 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 5007065 | 62.6% | |
| Lowercase Letter | 2992935 | 37.4% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 3 | 512224 | 10.2% | |
| 9 | 512135 | 10.2% | |
| 6 | 511528 | 10.2% | |
| 7 | 502564 | 10.0% | |
| 4 | 499049 | 10.0% | |
| 0 | 498232 | 10.0% | |
| 1 | 497414 | 9.9% | |
| 2 | 496392 | 9.9% | |
| 8 | 491606 | 9.8% | |
| 5 | 485921 | 9.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| f | 518304 | 17.3% | |
| a | 507854 | 17.0% | |
| b | 495025 | 16.5% | |
| d | 491475 | 16.4% | |
| e | 491376 | 16.4% | |
| c | 488901 | 16.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 5007065 | 62.6% | |
| Latin | 2992935 | 37.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 3 | 512224 | 10.2% | |
| 9 | 512135 | 10.2% | |
| 6 | 511528 | 10.2% | |
| 7 | 502564 | 10.0% | |
| 4 | 499049 | 10.0% | |
| 0 | 498232 | 10.0% | |
| 1 | 497414 | 9.9% | |
| 2 | 496392 | 9.9% | |
| 8 | 491606 | 9.8% | |
| 5 | 485921 | 9.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| f | 518304 | 17.3% | |
| a | 507854 | 17.0% | |
| b | 495025 | 16.5% | |
| d | 491475 | 16.4% | |
| e | 491376 | 16.4% | |
| c | 488901 | 16.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| f | 518304 | 6.5% | |
| 3 | 512224 | 6.4% | |
| 9 | 512135 | 6.4% | |
| 6 | 511528 | 6.4% | |
| a | 507854 | 6.3% | |
| 7 | 502564 | 6.3% | |
| 4 | 499049 | 6.2% | |
| 0 | 498232 | 6.2% | |
| 1 | 497414 | 6.2% | |
| 2 | 496392 | 6.2% | |
| b | 495025 | 6.2% | |
| 8 | 491606 | 6.1% | |
| d | 491475 | 6.1% | |
| e | 491376 | 6.1% | |
| c | 488901 | 6.1% | |
| 5 | 485921 | 6.1% |
| Distinct count | 5150 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 8a4875bd | 60497 |
|---|---|
| 1f0bc64f | 35049 |
| d787e91b | 34630 |
| 76dc4769 | 18999 |
| be6db1d7 | 18212 |
| Other values (5145) |
| Value | Count | Frequency (%) | |
| 8a4875bd | 60497 | 6.0% | |
| 1f0bc64f | 35049 | 3.5% | |
| d787e91b | 34630 | 3.5% | |
| 76dc4769 | 18999 | 1.9% | |
| be6db1d7 | 18212 | 1.8% | |
| a0f5f879 | 16126 | 1.6% | |
| 4ea23a13 | 16043 | 1.6% | |
| 7abbbd5c | 15640 | 1.6% | |
| ecb851b2 | 15097 | 1.5% | |
| d4897fef | 11967 | 1.2% | |
| 5096d134 | 11739 | 1.2% | |
| 711ee120 | 11036 | 1.1% | |
| 1ccc7835 | 10544 | 1.1% | |
| e1eae715 | 10384 | 1.0% | |
| c6263d8a | 9752 | 1.0% | |
| 84ebbcd4 | 9536 | 1.0% | |
| be74e6fe | 9440 | 0.9% | |
| 3bd9e8e7 | 8914 | 0.9% | |
| 0eb711ec | 8889 | 0.9% | |
| 0bcabeaf | 8881 | 0.9% | |
| f07e20f8 | 8874 | 0.9% | |
| 3bb1ddd7 | 8871 | 0.9% | |
| 981edffc | 8809 | 0.9% | |
| 779d90c2 | 8697 | 0.9% | |
| 36b67a2a | 8585 | 0.9% | |
| Other values (5125) | 614789 | 61.5% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| b | 644633 | 8.1% | |
| 7 | 603566 | 7.5% | |
| e | 590214 | 7.4% | |
| d | 565036 | 7.1% | |
| 8 | 549322 | 6.9% | |
| 1 | 544798 | 6.8% | |
| 4 | 527628 | 6.6% | |
| a | 496127 | 6.2% | |
| f | 480601 | 6.0% | |
| 6 | 471916 | 5.9% | |
| 5 | 462459 | 5.8% | |
| c | 458578 | 5.7% | |
| 9 | 439134 | 5.5% | |
| 0 | 395510 | 4.9% | |
| 2 | 385375 | 4.8% | |
| 3 | 385103 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 4764811 | 59.6% | |
| Lowercase Letter | 3235189 | 40.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| b | 644633 | 19.9% | |
| e | 590214 | 18.2% | |
| d | 565036 | 17.5% | |
| a | 496127 | 15.3% | |
| f | 480601 | 14.9% | |
| c | 458578 | 14.2% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 7 | 603566 | 12.7% | |
| 8 | 549322 | 11.5% | |
| 1 | 544798 | 11.4% | |
| 4 | 527628 | 11.1% | |
| 6 | 471916 | 9.9% | |
| 5 | 462459 | 9.7% | |
| 9 | 439134 | 9.2% | |
| 0 | 395510 | 8.3% | |
| 2 | 385375 | 8.1% | |
| 3 | 385103 | 8.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 4764811 | 59.6% | |
| Latin | 3235189 | 40.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| b | 644633 | 19.9% | |
| e | 590214 | 18.2% | |
| d | 565036 | 17.5% | |
| a | 496127 | 15.3% | |
| f | 480601 | 14.9% | |
| c | 458578 | 14.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 7 | 603566 | 12.7% | |
| 8 | 549322 | 11.5% | |
| 1 | 544798 | 11.4% | |
| 4 | 527628 | 11.1% | |
| 6 | 471916 | 9.9% | |
| 5 | 462459 | 9.7% | |
| 9 | 439134 | 9.2% | |
| 0 | 395510 | 8.3% | |
| 2 | 385375 | 8.1% | |
| 3 | 385103 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 8000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| b | 644633 | 8.1% | |
| 7 | 603566 | 7.5% | |
| e | 590214 | 7.4% | |
| d | 565036 | 7.1% | |
| 8 | 549322 | 6.9% | |
| 1 | 544798 | 6.8% | |
| 4 | 527628 | 6.6% | |
| a | 496127 | 6.2% | |
| f | 480601 | 6.0% | |
| 6 | 471916 | 5.9% | |
| 5 | 462459 | 5.8% | |
| c | 458578 | 5.7% | |
| 9 | 439134 | 5.5% | |
| 0 | 395510 | 4.9% | |
| 2 | 385375 | 4.8% | |
| 3 | 385103 | 4.8% |
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.014972 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 55098 |
| Zeros (%) | 5.5% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5272325077 |
|---|---|
| Coefficient of variation (CV) | 0.5194552241 |
| Kurtosis | 27.85927964 |
| Mean | 1.014972 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.457522816 |
| Sum | 1014972 |
| Variance | 0.2779741172 |
| Value | Count | Frequency (%) | |
| 1 | 922619 | 92.3% | |
| 0 | 55098 | 5.5% | |
| 4 | 19059 | 1.9% | |
| 5 | 3223 | 0.3% | |
| 2 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 55098 | 5.5% | |
| 1 | 922619 | 92.3% | |
| 2 | 1 | < 0.1% | |
| 4 | 19059 | 1.9% | |
| 5 | 3223 | 0.3% |
| Value | Count | Frequency (%) | |
| 5 | 3223 | 0.3% | |
| 4 | 19059 | 1.9% | |
| 2 | 1 | < 0.1% | |
| 1 | 922619 | 92.3% | |
| 0 | 55098 | 5.5% |
dconn
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 0 | |
|---|---|
| 2 | 81622 |
| 3 | 53894 |
| 5 | 1064 |
| Value | Count | Frequency (%) | |
| 0 | 863420 | 86.3% | |
| 2 | 81622 | 8.2% | |
| 3 | 53894 | 5.4% | |
| 5 | 1064 | 0.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 863420 | 86.3% | |
| 2 | 81622 | 8.2% | |
| 3 | 53894 | 5.4% | |
| 5 | 1064 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 1000000 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 863420 | 86.3% | |
| 2 | 81622 | 8.2% | |
| 3 | 53894 | 5.4% | |
| 5 | 1064 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 1000000 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 863420 | 86.3% | |
| 2 | 81622 | 8.2% | |
| 3 | 53894 | 5.4% | |
| 5 | 1064 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 863420 | 86.3% | |
| 2 | 81622 | 8.2% | |
| 3 | 53894 | 5.4% | |
| 5 | 1064 | 0.1% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.287912 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 719953 |
| Zeros (%) | 72.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5051998966 |
|---|---|
| Coefficient of variation (CV) | 1.754702467 |
| Kurtosis | 32.64191603 |
| Mean | 0.287912 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.322202321 |
| Sum | 287912 |
| Variance | 0.2552269355 |
| Value | Count | Frequency (%) | |
| 0 | 719953 | 72.0% | |
| 1 | 278289 | 27.8% | |
| 7 | 1051 | 0.1% | |
| 2 | 334 | < 0.1% | |
| 4 | 193 | < 0.1% | |
| 5 | 143 | < 0.1% | |
| 3 | 37 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 719953 | 72.0% | |
| 1 | 278289 | 27.8% | |
| 2 | 334 | < 0.1% | |
| 3 | 37 | < 0.1% | |
| 4 | 193 | < 0.1% | |
| 5 | 143 | < 0.1% | |
| 7 | 1051 | 0.1% |
| Value | Count | Frequency (%) | |
| 7 | 1051 | 0.1% | |
| 5 | 143 | < 0.1% | |
| 4 | 193 | < 0.1% | |
| 3 | 37 | < 0.1% | |
| 2 | 334 | < 0.1% | |
| 1 | 278289 | 27.8% | |
| 0 | 719953 | 72.0% |
A
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1004.966775 |
|---|---|
| Minimum | 1001 |
| Maximum | 1012 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 1002 |
| Q1 | 1005 |
| median | 1005 |
| Q3 | 1005 |
| 95-th percentile | 1005 |
| Maximum | 1012 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.093227467 |
|---|---|
| Coefficient of variation (CV) | 0.001087824487 |
| Kurtosis | 14.78001598 |
| Mean | 1004.966775 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.806447956 |
| Sum | 1004966775 |
| Variance | 1.195146295 |
| Value | Count | Frequency (%) | |
| 1005 | 918627 | 91.9% | |
| 1002 | 55098 | 5.5% | |
| 1010 | 22282 | 2.2% | |
| 1012 | 2758 | 0.3% | |
| 1007 | 882 | 0.1% | |
| 1001 | 210 | < 0.1% | |
| 1008 | 143 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1001 | 210 | < 0.1% | |
| 1002 | 55098 | 5.5% | |
| 1005 | 918627 | 91.9% | |
| 1007 | 882 | 0.1% | |
| 1008 | 143 | < 0.1% | |
| 1010 | 22282 | 2.2% | |
| 1012 | 2758 | 0.3% |
| Value | Count | Frequency (%) | |
| 1012 | 2758 | 0.3% | |
| 1010 | 22282 | 2.2% | |
| 1008 | 143 | < 0.1% | |
| 1007 | 882 | 0.1% | |
| 1005 | 918627 | 91.9% | |
| 1002 | 55098 | 5.5% | |
| 1001 | 210 | < 0.1% |
| Distinct count | 2246 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18848.445596 |
|---|---|
| Minimum | 375 |
| Maximum | 24044 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 375 |
|---|---|
| 5-th percentile | 6393 |
| Q1 | 16920 |
| median | 20346 |
| Q3 | 21894 |
| 95-th percentile | 23561 |
| Maximum | 24044 |
| Range | 23669 |
| Interquartile range (IQR) | 4974 |
Descriptive statistics
| Standard deviation | 4951.645567 |
|---|---|
| Coefficient of variation (CV) | 0.2627084309 |
| Kurtosis | 3.482762077 |
| Mean | 18848.4456 |
| Median Absolute Deviation (MAD) | 2336 |
| Skewness | -1.887985203 |
| Sum | 1.88484456e+10 |
| Variance | 24518793.82 |
| Value | Count | Frequency (%) | |
| 4687 | 23384 | 2.3% | |
| 21611 | 22681 | 2.3% | |
| 21189 | 18980 | 1.9% | |
| 21191 | 18954 | 1.9% | |
| 19771 | 18281 | 1.8% | |
| 19772 | 18064 | 1.8% | |
| 16208 | 16425 | 1.6% | |
| 20108 | 14410 | 1.4% | |
| 8330 | 13735 | 1.4% | |
| 19950 | 13080 | 1.3% | |
| 15701 | 12781 | 1.3% | |
| 15703 | 12686 | 1.3% | |
| 15705 | 12520 | 1.3% | |
| 15707 | 12387 | 1.2% | |
| 15699 | 12363 | 1.2% | |
| 15708 | 12304 | 1.2% | |
| 15704 | 12194 | 1.2% | |
| 15702 | 11907 | 1.2% | |
| 15706 | 11707 | 1.2% | |
| 16615 | 10903 | 1.1% | |
| 23804 | 10166 | 1.0% | |
| 21767 | 9279 | 0.9% | |
| 21768 | 9081 | 0.9% | |
| 22676 | 8552 | 0.9% | |
| 17239 | 8359 | 0.8% | |
| Other values (2221) | 654817 | 65.5% |
| Value | Count | Frequency (%) | |
| 375 | 2065 | 0.2% | |
| 376 | 5 | < 0.1% | |
| 377 | 1987 | 0.2% | |
| 380 | 1809 | 0.2% | |
| 381 | 94 | < 0.1% | |
| 451 | 22 | < 0.1% | |
| 452 | 1451 | 0.1% | |
| 454 | 1453 | 0.1% | |
| 455 | 1 | < 0.1% | |
| 456 | 1505 | 0.2% |
| Value | Count | Frequency (%) | |
| 24044 | 2 | < 0.1% | |
| 24043 | 41 | < 0.1% | |
| 24042 | 22 | < 0.1% | |
| 24041 | 132 | < 0.1% | |
| 24040 | 131 | < 0.1% | |
| 24037 | 47 | < 0.1% | |
| 24036 | 616 | 0.1% | |
| 24035 | 624 | 0.1% | |
| 24034 | 1523 | 0.2% | |
| 24033 | 101 | < 0.1% |
C
Real number (ℝ≥0)
| Distinct count | 8 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 318.892852 |
|---|---|
| Minimum | 120 |
| Maximum | 1024 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 120 |
|---|---|
| 5-th percentile | 300 |
| Q1 | 320 |
| median | 320 |
| Q3 | 320 |
| 95-th percentile | 320 |
| Maximum | 1024 |
| Range | 904 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 21.31636755 |
|---|---|
| Coefficient of variation (CV) | 0.06684492117 |
| Kurtosis | 342.6409646 |
| Mean | 318.892852 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 14.99252409 |
| Sum | 318892852 |
| Variance | 454.3875257 |
| Value | Count | Frequency (%) | |
| 320 | 932772 | 93.3% | |
| 300 | 57835 | 5.8% | |
| 216 | 7309 | 0.7% | |
| 728 | 1851 | 0.2% | |
| 120 | 82 | < 0.1% | |
| 1024 | 70 | < 0.1% | |
| 480 | 51 | < 0.1% | |
| 768 | 30 | < 0.1% |
| Value | Count | Frequency (%) | |
| 120 | 82 | < 0.1% | |
| 216 | 7309 | 0.7% | |
| 300 | 57835 | 5.8% | |
| 320 | 932772 | 93.3% | |
| 480 | 51 | < 0.1% | |
| 728 | 1851 | 0.2% | |
| 768 | 30 | < 0.1% | |
| 1024 | 70 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1024 | 70 | < 0.1% | |
| 768 | 30 | < 0.1% | |
| 728 | 1851 | 0.2% | |
| 480 | 51 | < 0.1% | |
| 320 | 932772 | 93.3% | |
| 300 | 57835 | 5.8% | |
| 216 | 7309 | 0.7% | |
| 120 | 82 | < 0.1% |
D
Real number (ℝ≥0)
| Distinct count | 9 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.096674 |
|---|---|
| Minimum | 20 |
| Maximum | 1024 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 50 |
| median | 50 |
| Q3 | 50 |
| 95-th percentile | 50 |
| Maximum | 1024 |
| Range | 1004 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 47.20946226 |
|---|---|
| Coefficient of variation (CV) | 0.7855586527 |
| Kurtosis | 33.40002178 |
| Mean | 60.096674 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.186969055 |
| Sum | 60096674 |
| Variance | 2228.733327 |
| Value | Count | Frequency (%) | |
| 50 | 943356 | 94.3% | |
| 250 | 44712 | 4.5% | |
| 36 | 7309 | 0.7% | |
| 480 | 2539 | 0.3% | |
| 90 | 1851 | 0.2% | |
| 20 | 82 | < 0.1% | |
| 768 | 70 | < 0.1% | |
| 320 | 51 | < 0.1% | |
| 1024 | 30 | < 0.1% |
| Value | Count | Frequency (%) | |
| 20 | 82 | < 0.1% | |
| 36 | 7309 | 0.7% | |
| 50 | 943356 | 94.3% | |
| 90 | 1851 | 0.2% | |
| 250 | 44712 | 4.5% | |
| 320 | 51 | < 0.1% | |
| 480 | 2539 | 0.3% | |
| 768 | 70 | < 0.1% | |
| 1024 | 30 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1024 | 30 | < 0.1% | |
| 768 | 70 | < 0.1% | |
| 480 | 2539 | 0.3% | |
| 320 | 51 | < 0.1% | |
| 250 | 44712 | 4.5% | |
| 90 | 1851 | 0.2% | |
| 50 | 943356 | 94.3% | |
| 36 | 7309 | 0.7% | |
| 20 | 82 | < 0.1% |
| Distinct count | 416 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2113.1171 |
|---|---|
| Minimum | 112 |
| Maximum | 2757 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 112 |
|---|---|
| 5-th percentile | 547 |
| Q1 | 1863 |
| median | 2323 |
| Q3 | 2526 |
| 95-th percentile | 2691 |
| Maximum | 2757 |
| Range | 2645 |
| Interquartile range (IQR) | 663 |
Descriptive statistics
| Standard deviation | 608.8224622 |
|---|---|
| Coefficient of variation (CV) | 0.2881158182 |
| Kurtosis | 2.269419741 |
| Mean | 2113.1171 |
| Median Absolute Deviation (MAD) | 301 |
| Skewness | -1.639087222 |
| Sum | 2113117100 |
| Variance | 370664.7905 |
| Value | Count | Frequency (%) | |
| 1722 | 111250 | 11.1% | |
| 2424 | 37935 | 3.8% | |
| 2227 | 36677 | 3.7% | |
| 1800 | 29581 | 3.0% | |
| 423 | 23384 | 2.3% | |
| 2480 | 22957 | 2.3% | |
| 2502 | 21193 | 2.1% | |
| 2528 | 20539 | 2.1% | |
| 2506 | 19690 | 2.0% | |
| 2374 | 18531 | 1.9% | |
| 2545 | 17650 | 1.8% | |
| 1872 | 17200 | 1.7% | |
| 1994 | 15121 | 1.5% | |
| 2526 | 14693 | 1.5% | |
| 2299 | 14410 | 1.4% | |
| 1863 | 14125 | 1.4% | |
| 761 | 13835 | 1.4% | |
| 2333 | 12672 | 1.3% | |
| 1993 | 12141 | 1.2% | |
| 2665 | 11862 | 1.2% | |
| 2676 | 11816 | 1.2% | |
| 1873 | 11574 | 1.2% | |
| 2507 | 10896 | 1.1% | |
| 2726 | 10166 | 1.0% | |
| 2566 | 8980 | 0.9% | |
| Other values (391) | 461122 | 46.1% |
| Value | Count | Frequency (%) | |
| 112 | 5974 | 0.6% | |
| 122 | 5712 | 0.6% | |
| 153 | 324 | < 0.1% | |
| 178 | 6255 | 0.6% | |
| 196 | 105 | < 0.1% | |
| 394 | 432 | < 0.1% | |
| 423 | 23384 | 2.3% | |
| 479 | 5014 | 0.5% | |
| 544 | 2498 | 0.2% | |
| 547 | 3703 | 0.4% |
| Value | Count | Frequency (%) | |
| 2757 | 65 | < 0.1% | |
| 2756 | 263 | < 0.1% | |
| 2755 | 1287 | 0.1% | |
| 2754 | 1523 | 0.2% | |
| 2753 | 101 | < 0.1% | |
| 2749 | 849 | 0.1% | |
| 2748 | 876 | 0.1% | |
| 2747 | 2161 | 0.2% | |
| 2745 | 108 | < 0.1% | |
| 2743 | 123 | < 0.1% |
F
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 0 | |
|---|---|
| 3 | |
| 2 | |
| 1 | 67423 |
| Value | Count | Frequency (%) | |
| 0 | 418976 | 41.9% | |
| 3 | 337978 | 33.8% | |
| 2 | 175623 | 17.6% | |
| 1 | 67423 | 6.7% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 418976 | 41.9% | |
| 3 | 337978 | 33.8% | |
| 2 | 175623 | 17.6% | |
| 1 | 67423 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 1000000 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 418976 | 41.9% | |
| 3 | 337978 | 33.8% | |
| 2 | 175623 | 17.6% | |
| 1 | 67423 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 1000000 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 418976 | 41.9% | |
| 3 | 337978 | 33.8% | |
| 2 | 175623 | 17.6% | |
| 1 | 67423 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 418976 | 41.9% | |
| 3 | 337978 | 33.8% | |
| 2 | 175623 | 17.6% | |
| 1 | 67423 | 6.7% |
G
Real number (ℝ≥0)
| Distinct count | 66 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 226.9311 |
|---|---|
| Minimum | 33 |
| Maximum | 1839 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 35 |
| median | 39 |
| Q3 | 171 |
| 95-th percentile | 1063 |
| Maximum | 1839 |
| Range | 1806 |
| Interquartile range (IQR) | 136 |
Descriptive statistics
| Standard deviation | 350.4800993 |
|---|---|
| Coefficient of variation (CV) | 1.544433968 |
| Kurtosis | 3.857232288 |
| Mean | 226.9311 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 2.157856948 |
| Sum | 226931100 |
| Variance | 122836.3 |
| Value | Count | Frequency (%) | |
| 35 | 300039 | 30.0% | |
| 39 | 218697 | 21.9% | |
| 167 | 77933 | 7.8% | |
| 161 | 39260 | 3.9% | |
| 47 | 36000 | 3.6% | |
| 1327 | 26959 | 2.7% | |
| 297 | 25272 | 2.5% | |
| 163 | 22969 | 2.3% | |
| 175 | 20110 | 2.0% | |
| 679 | 18311 | 1.8% | |
| 935 | 17501 | 1.8% | |
| 687 | 13833 | 1.4% | |
| 41 | 12812 | 1.3% | |
| 1063 | 12776 | 1.3% | |
| 33 | 11757 | 1.2% | |
| 431 | 10611 | 1.1% | |
| 803 | 10166 | 1.0% | |
| 1319 | 9606 | 1.0% | |
| 419 | 8107 | 0.8% | |
| 303 | 7988 | 0.8% | |
| 171 | 7563 | 0.8% | |
| 169 | 7315 | 0.7% | |
| 299 | 7312 | 0.7% | |
| 427 | 6748 | 0.7% | |
| 34 | 6684 | 0.7% | |
| Other values (41) | 63671 | 6.4% |
| Value | Count | Frequency (%) | |
| 33 | 11757 | 1.2% | |
| 34 | 6684 | 0.7% | |
| 35 | 300039 | 30.0% | |
| 38 | 4510 | 0.5% | |
| 39 | 218697 | 21.9% | |
| 41 | 12812 | 1.3% | |
| 43 | 5277 | 0.5% | |
| 45 | 81 | < 0.1% | |
| 47 | 36000 | 3.6% | |
| 161 | 39260 | 3.9% |
| Value | Count | Frequency (%) | |
| 1839 | 276 | < 0.1% | |
| 1835 | 467 | < 0.1% | |
| 1831 | 595 | 0.1% | |
| 1711 | 2336 | 0.2% | |
| 1583 | 92 | < 0.1% | |
| 1575 | 496 | < 0.1% | |
| 1451 | 3263 | 0.3% | |
| 1447 | 3 | < 0.1% | |
| 1327 | 26959 | 2.7% | |
| 1319 | 9606 | 1.0% |
H
Real number (ℝ)
| Distinct count | 164 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53279.713029 |
|---|---|
| Minimum | -1 |
| Maximum | 100248 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 100049 |
| Q3 | 100093 |
| 95-th percentile | 100190 |
| Maximum | 100248 |
| Range | 100249 |
| Interquartile range (IQR) | 100094 |
Descriptive statistics
| Standard deviation | 49952.76706 |
|---|---|
| Coefficient of variation (CV) | 0.9375569842 |
| Kurtosis | -1.983339436 |
| Mean | 53279.71303 |
| Median Absolute Deviation (MAD) | 144 |
| Skewness | -0.129082552 |
| Sum | 5.327971303e+10 |
| Variance | 2495278937 |
| Value | Count | Frequency (%) | |
| -1 | 467796 | 46.8% | |
| 100084 | 60157 | 6.0% | |
| 100148 | 44125 | 4.4% | |
| 100111 | 42740 | 4.3% | |
| 100077 | 39227 | 3.9% | |
| 100075 | 38613 | 3.9% | |
| 100081 | 33144 | 3.3% | |
| 100083 | 26559 | 2.7% | |
| 100156 | 25462 | 2.5% | |
| 100193 | 17446 | 1.7% | |
| 100176 | 16090 | 1.6% | |
| 100074 | 14560 | 1.5% | |
| 100079 | 14011 | 1.4% | |
| 100189 | 11579 | 1.2% | |
| 100076 | 11368 | 1.1% | |
| 100192 | 5963 | 0.6% | |
| 100190 | 5723 | 0.6% | |
| 100191 | 5541 | 0.6% | |
| 100188 | 5413 | 0.5% | |
| 100013 | 5000 | 0.5% | |
| 100031 | 4558 | 0.5% | |
| 100155 | 3885 | 0.4% | |
| 100194 | 3668 | 0.4% | |
| 100181 | 3608 | 0.4% | |
| 100000 | 3523 | 0.4% | |
| Other values (139) | 90241 | 9.0% |
| Value | Count | Frequency (%) | |
| -1 | 467796 | 46.8% | |
| 100000 | 3523 | 0.4% | |
| 100001 | 199 | < 0.1% | |
| 100002 | 180 | < 0.1% | |
| 100003 | 2859 | 0.3% | |
| 100004 | 2128 | 0.2% | |
| 100005 | 1707 | 0.2% | |
| 100006 | 1 | < 0.1% | |
| 100010 | 30 | < 0.1% | |
| 100012 | 357 | < 0.1% |
| Value | Count | Frequency (%) | |
| 100248 | 345 | < 0.1% | |
| 100244 | 24 | < 0.1% | |
| 100241 | 596 | 0.1% | |
| 100233 | 2930 | 0.3% | |
| 100229 | 51 | < 0.1% | |
| 100228 | 1596 | 0.2% | |
| 100225 | 174 | < 0.1% | |
| 100224 | 45 | < 0.1% | |
| 100221 | 1841 | 0.2% | |
| 100217 | 1104 | 0.1% |
I
Real number (ℝ≥0)
| Distinct count | 60 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83.386604 |
|---|---|
| Minimum | 1 |
| Maximum | 255 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 23 |
| median | 61 |
| Q3 | 101 |
| 95-th percentile | 221 |
| Maximum | 255 |
| Range | 254 |
| Interquartile range (IQR) | 78 |
Descriptive statistics
| Standard deviation | 70.30124804 |
|---|---|
| Coefficient of variation (CV) | 0.8430760418 |
| Kurtosis | -0.2603528011 |
| Mean | 83.386604 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 1.092674088 |
| Sum | 83386604 |
| Variance | 4942.265476 |
| Value | Count | Frequency (%) | |
| 23 | 219749 | 22.0% | |
| 221 | 125152 | 12.5% | |
| 79 | 113850 | 11.4% | |
| 48 | 53691 | 5.4% | |
| 71 | 52219 | 5.2% | |
| 61 | 51143 | 5.1% | |
| 157 | 45519 | 4.6% | |
| 32 | 43948 | 4.4% | |
| 33 | 37083 | 3.7% | |
| 52 | 29529 | 3.0% | |
| 42 | 25210 | 2.5% | |
| 51 | 21276 | 2.1% | |
| 15 | 18848 | 1.9% | |
| 212 | 16394 | 1.6% | |
| 43 | 14632 | 1.5% | |
| 117 | 10236 | 1.0% | |
| 229 | 10166 | 1.0% | |
| 13 | 9554 | 1.0% | |
| 16 | 8726 | 0.9% | |
| 156 | 8266 | 0.8% | |
| 68 | 8051 | 0.8% | |
| 159 | 7278 | 0.7% | |
| 95 | 6877 | 0.7% | |
| 46 | 5767 | 0.6% | |
| 246 | 4908 | 0.5% | |
| Other values (35) | 51928 | 5.2% |
| Value | Count | Frequency (%) | |
| 1 | 81 | < 0.1% | |
| 13 | 9554 | 1.0% | |
| 15 | 18848 | 1.9% | |
| 16 | 8726 | 0.9% | |
| 17 | 4115 | 0.4% | |
| 20 | 324 | < 0.1% | |
| 23 | 219749 | 22.0% | |
| 32 | 43948 | 4.4% | |
| 33 | 37083 | 3.7% | |
| 35 | 1168 | 0.1% |
| Value | Count | Frequency (%) | |
| 255 | 108 | < 0.1% | |
| 253 | 1921 | 0.2% | |
| 251 | 523 | 0.1% | |
| 246 | 4908 | 0.5% | |
| 229 | 10166 | 1.0% | |
| 221 | 125152 | 12.5% | |
| 219 | 37 | < 0.1% | |
| 212 | 16394 | 1.6% | |
| 204 | 2301 | 0.2% | |
| 195 | 96 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | like | hour | sid | sdomain | scat | aid | adomain | acat | did | dip | dmodel | dtype | dconn | pos | A | B | C | D | E | F | G | H | I | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5171432 | 1 | 19122206 | 85f751fd | c4e18dd6 | 50e219e0 | fb7c70a3 | d9b5648e | 0f2161f8 | 337bf809 | 56d6c8a9 | aad45b01 | 1 | 0 | 0 | 1005 | 21747 | 320 | 50 | 2504 | 3 | 41 | 100160 | 111 |
| 1 | 24755502 | 0 | 19122707 | d6137915 | bb1ef334 | f028772b | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | fe02cd8a | 8a4875bd | 1 | 0 | 0 | 1005 | 20213 | 320 | 50 | 2316 | 0 | 167 | 100081 | 16 |
| 2 | 22223641 | 0 | 19122613 | e59ef3fc | 0a4015b2 | 335d28a8 | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | fa56a0ec | 853073be | 1 | 0 | 1 | 1005 | 19771 | 320 | 50 | 2227 | 0 | 935 | 100079 | 48 |
| 3 | 32458364 | 0 | 19122900 | 85f751fd | c4e18dd6 | 50e219e0 | e2fcccd2 | 5c5a694b | 0f2161f8 | a3215618 | 9a10a7eb | be74e6fe | 1 | 0 | 0 | 1005 | 4687 | 320 | 50 | 423 | 2 | 39 | 100148 | 32 |
| 4 | 6027438 | 1 | 19122209 | 4bf5bbe2 | 6b560cc1 | 28905ebd | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | e55e55a1 | 836d2439 | 1 | 0 | 0 | 1005 | 21789 | 320 | 50 | 2512 | 2 | 303 | -1 | 52 |
| 5 | 10310046 | 0 | 19122305 | b7e9786d | b12b9f85 | f028772b | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | 9678ccab | a0f5f879 | 1 | 0 | 1 | 1005 | 16208 | 320 | 50 | 1800 | 3 | 167 | 100077 | 23 |
| 6 | 36769183 | 0 | 19123004 | 85f751fd | c4e18dd6 | 50e219e0 | 66f5e02e | 6f7ca2ba | 0f2161f8 | 79bc0e4f | c72bfab0 | 2891f384 | 1 | 0 | 0 | 1005 | 23804 | 320 | 50 | 2726 | 3 | 803 | 100091 | 229 |
| 7 | 18757124 | 0 | 19122515 | 6c5b482c | 7687a86e | 3e814130 | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | c782a684 | 84ebbcd4 | 1 | 0 | 0 | 1005 | 17654 | 300 | 250 | 1994 | 2 | 39 | -1 | 33 |
| 8 | 18219411 | 0 | 19122512 | 1fbe01fe | f3845767 | 28905ebd | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | 6f669b29 | cdf6ea96 | 1 | 0 | 0 | 1005 | 15708 | 320 | 50 | 1722 | 0 | 35 | -1 | 79 |
| 9 | 40086614 | 0 | 19123020 | 85f751fd | c4e18dd6 | 50e219e0 | f0d41ff1 | 2347f47a | 0f2161f8 | a99f214a | baa7e549 | 2de871e6 | 1 | 0 | 0 | 1005 | 24040 | 320 | 50 | 2756 | 3 | 299 | 100112 | 61 |
Last rows
| df_index | like | hour | sid | sdomain | scat | aid | adomain | acat | did | dip | dmodel | dtype | dconn | pos | A | B | C | D | E | F | G | H | I | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 999990 | 11489892 | 0 | 19122311 | b7e9786d | b12b9f85 | f028772b | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | b3bb4959 | 5ec45883 | 1 | 0 | 1 | 1005 | 22120 | 320 | 50 | 1702 | 0 | 1059 | 100079 | 110 |
| 999991 | 8001580 | 0 | 19122214 | a7853007 | 7e091613 | f028772b | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | 009a7861 | 3bb1ddd7 | 1 | 0 | 1 | 1005 | 9478 | 320 | 50 | 906 | 3 | 1451 | 100156 | 61 |
| 999992 | 23753652 | 0 | 19122622 | 85f751fd | c4e18dd6 | 50e219e0 | ce183bbd | ae637522 | cef3e649 | a99f214a | d8b9fb64 | 36b67a2a | 1 | 0 | 0 | 1005 | 22516 | 320 | 50 | 2597 | 1 | 167 | 100005 | 71 |
| 999993 | 20843974 | 0 | 19122606 | 5114c672 | 3f2f3819 | 3e814130 | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | e2114742 | 4ea23a13 | 1 | 0 | 1 | 1005 | 19771 | 320 | 50 | 2227 | 0 | 679 | -1 | 48 |
| 999994 | 11126076 | 0 | 19122309 | 85f751fd | c4e18dd6 | 50e219e0 | 54c5d545 | 2347f47a | 0f2161f8 | a99f214a | b6d940b0 | d787e91b | 1 | 0 | 0 | 1005 | 15702 | 320 | 50 | 1722 | 0 | 35 | -1 | 79 |
| 999995 | 14043529 | 0 | 19122405 | e151e245 | 7e091613 | f028772b | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | 30e8f0b7 | 1f0bc64f | 1 | 0 | 1 | 1005 | 21679 | 320 | 50 | 2495 | 2 | 167 | 100173 | 23 |
| 999996 | 21948821 | 0 | 19122612 | 85f751fd | c4e18dd6 | 50e219e0 | 92f5800b | ae637522 | 0f2161f8 | a99f214a | 777a32ec | 496515fa | 1 | 3 | 0 | 1005 | 21189 | 320 | 50 | 2424 | 1 | 161 | 100193 | 71 |
| 999997 | 533594 | 0 | 19122103 | 1fbe01fe | f3845767 | 28905ebd | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | d8a5f6c9 | 711ee120 | 1 | 0 | 0 | 1005 | 15706 | 320 | 50 | 1722 | 0 | 35 | 100084 | 79 |
| 999998 | 18990869 | 0 | 19122516 | 1fbe01fe | f3845767 | 28905ebd | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | ed10de3f | 8a4875bd | 1 | 0 | 0 | 1005 | 15701 | 320 | 50 | 1722 | 0 | 35 | 100084 | 79 |
| 999999 | 24579642 | 0 | 19122706 | 1fbe01fe | f3845767 | 28905ebd | ecad2386 | 7801e8d9 | 07d7df22 | a99f214a | dd570a33 | a8d2c4cf | 1 | 0 | 0 | 1005 | 22676 | 320 | 50 | 2616 | 0 | 35 | 100083 | 51 |